Searching for Ground Truth: A Stepping Stone in Automating Genre Classification
نویسندگان
چکیده
This paper examines genre classification of documents and its role in enabling the effective automated management of digital documents by digital libraries and other repositories. We have previously presented genre classification as a valuable step toward achieving automated extraction of descriptive metadata for digital material. Here, we present results from experiments using human labellers, conducted to assist in genre characterisation and the prediction of obstacles which need to be overcome by an automated system, and to contribute to the process of creating a solid testbed corpus for extending automated genre classification and testing metadata extraction tools across genres. We also describe the performance of two classifiers based on image and stylistic modeling features in labelling the data resulting from the agreement of three human labellers across fifteen genre classes.
منابع مشابه
Music Genre Classification Revisited: An In-Depth Examination Guided by Music Experts
Despite their many identified shortcomings, music genres are still often used as ground truth and as a proxy for music similarity. In this work we therefore take another in-depth look at genre classification, this time with the help of music experts. In comparison to existing work, we aim at including the viewpoint of different stakeholders to investigate whether musicians and end-user music ta...
متن کاملGenre Classification based on Predominant Melodic Pitch Contours
We present an automatic genre classification system based on melodic features. First a ground truth genre dataset composed of polyphonic music excerpts is compiled. Predominant melodic pitch contours are then estimated, from which a series of descriptors is extracted. These features are related to melody pitch, variation and expressiveness (e.g. vibrato characteristics, pitch distributions, con...
متن کاملA Comparison of Human, Automatic and Collaborative Music Genre Classification and User Centric Evaluation of Genre Classification Systems
In this paper two sets of evaluation experiments are conducted. First, we compare state-of-the-art automatic music genre classification algorithms to human performance on the same dataset, via a listening experiment. This will show that the improvements of contentbased systems over the last years have reduced the gap between automatic and human classification performance, but could not yet clos...
متن کاملCross-Collection Evaluation for Music Classification Tasks
Many studies in music classification are concerned with obtaining the highest possible cross-validation result. However, some studies have noted that cross-validation may be prone to biases and that additional evaluations based on independent out-of-sample data are desirable. In this paper we present a methodology and software tools for cross-collection evaluation for music classification tasks...
متن کاملA System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models
We describe a system for automatic chord transcription from the raw audio using genre-specific hidden Markov models trained on audio-from-symbolic data. In order to avoid enormous amount of human labor required to manually annotate the chord labels for ground-truth, we use symbolic data such as MIDI files to automate the labeling process. In parallel, we synthesize the same symbolic files to pr...
متن کامل